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The invention has for object the new recombinant prolyl-dipeptidyl-peptidase enzyme (DPP IV) from Aspergillus oryzae comprising 
the amino-acid sequence from amino acid 1 to amino acid 755 of SEQ ID NO:2 or functional derivatives thereof, and providing a high 
level of hydrolysing specificity towards proteins and peptides starting with X-Pro- thus liberating dipeptides of X-Pro type, wherein X 
is any amino acid. The invention also provides a DNA molecule encoding the enzyme according to the invention, cells expressing the 
enzyme according to the invention by recombinant technology, an Aspergillus naturally providing a prolyl-dipeptidyl-peptidase activity 
which has integrated multiple copies of the Aspergillus native promoter which naturally directs the expression of the gene encoding the 
prolyl-dipeptidyl-peptidase activity, Aspergillus naturally providing a prolyl-dipeptidyl-peptidase activity which is manipulated genetically 
so that the dppW gene is inactivated. The invention provides a method for producing the enzyme according to the invention, comprising 
cultivating the cells of the invention in a suitable growth medium under conditions that the cells express the enzyme, and optionally 
isolating the enzyme in the form of a concentrate. The invention provides the use of the enzyme or the cells of the invention to hydrolyse 
protein containing materials. The invention provides the use of an enzyme and/or a cell providing a prolyl-dipeptidyl-peptidase activity, 
in combination with at least an enzyme providing a prolidase to hydrolyse protein containing materials. In a last further aspect, the 
invention provides a food product comprising a protein hydrolysate obtainable by fermentation with at least a microorganism providing a 
prolyl-dipeptidyl-peptidase activity higher than 50 mU per ml when grown in a minimal medium containing 1 % (w/v) of wheat gluten. 
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Cloning of the prolyl-dipeptidyl-peptidase from Aspergillus oryzae 

The present invention relates to a new recombinant prolyl-dipeptidyl-peptidase 
from Aspergillus oryzae, a gene encoding this enzyme, recombinant cells 
5 expressing this enzyme, and methods for hydrolysing protein containing materials. 

State pf the ftrt 

Hydrolysed proteins, which are widely used in the food industry, may be prepared 
10 by hydrolysis of protein material with acid, alkali or enzymes. However, on the 
one hand, acid or alkaline hydrolysis can destroy the essential amino acids 
produced during hydrolysis thus reducing the nutritional value, whereas enzymatic 
hydrolysis rarely goes to completion so that the hydrolysed protein contains 
substantial amounts of peptides. 

15 

The filamentous ascomycete Aspergillus oryzae is known to secrete a large variety 
of amylases, proteinases and peptidases, the action of which are essential for the 
efficient solubilisation and hydrolysis of raw materials (see WO94/25580). 
Various methods have been used Aspergillus oryzae for the preparation of food 
20 products, especially methods involving the use of a koji culture. 

EP4 17481 (Nestle) thus describes a process for the production of a fermented soya 
sauce, in which a koji is prepared by mixing an Aspergillus oryzae koji culture 
with a mixture of cooked soya and roasted wheat, the koji is then hydrolysed in 
25 aqueous suspension for 3 to 8 hours at 45°C to 60°C with the enzymes produced 
during fermentation of the Aspergillus oryzae koji culture, a moromi is further 
prepared by adding sodium chloride to the hydrolysed koji suspension, the 
moromi is left to ferment and is then pressed and the liquor obtained is pasteurized 
and clarified. 

30 

EP429760 (Nestle) describes a process for the production of a flavouring agent in 
which an aqueous suspension of a protein-rich material is prepared, the proteins 
are solubilized by hydrolysis of the suspension with a protease at pH6.0 to 1 1.0, 
the suspension is heat-trated at pH 4.6 to 6.5, and the suspension is ripened with 
35 enzymes of a koji culture fermented by Aspergillus oryzae. 
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Likewise, EP96201 923.8 (Nestle) describes a process for the production of a 
meat flavour, in which a mixture containing a vegetal proteinaceous source and a 
vegetale carbohydrates containing source is prepared, said mixture having intially 
at least 45% dry matter, the mixture is inoculated with a koji culture fermented by 
5 Aspergillus oryzae and by one or more another species of microorganisms 
involved in the traditional fermentation of meat, and the mixture is incubated until 
meat flavours are formed. 

Depending on the nature of the protein and the enzymes used for proteolysis, the 
10 peptides formed can however have extremely bitter tastes and are thus 
organoleptically undesirable. There is hence a need for methods of hydrolysing 
proteins leading to high degree of protein hydrolysis and to hydrolysates with 
excellent organoleptic properties. 

15 In addition, in protein rich materials subjected to enzymatic hydrolysis, a high 
level of glutaminase is required to convert glutamine into glutamic acid which is 
an important natural taste enhancer (see W095/31114). Biochemical analysis of 
residual peptides in cereals hydrolysed by Aspergillus oryzae, i.e. wheat gluten, 
shows however that a considerable amount of glutamine remains sequestered in 

20 proline containing peptides (Adler-Nissen, In: Enzymatic hydrolysis of food 
proteins. Elsevier Applied Sciences Publishers LTD, pl20, 1986). There is hence 
a need for methods of hydrolysing proteins leading to liberation of high amount of 
glutamine. 

25 Among the different proteases known from koji molds, two neutral endopeptidase 
(Nakadai et al % Agric. Biol. Chem., 17, 2695-2708, 1973), an alkaline 
endopeptidase (Nakadai et ai, Agric. Biol. Chem., 12, 2685-2694, 1973), an 
aspartic protease (Tsujita et ai, Biochem. Biophys Acta, 445 , 194-204, 1976), 
several aminopeptidases (Ozawa et ai y Agric. Biol. Chem., 12, 1285-1293, 1973), 

30 several carboxypeptidases (Nakadai et ai, Agric. Biol. Chem., 12, 1237-1251, 
1970) have been identified and purified. 

More recently a prolyl-dipeptidyl-peptidase activity has been detected in 
Aspergillus oryzae, which is an enzyme providing a high level of hydrolysing 
35 specificity towards proteins and peptides starting with X-Pro- thus liberating 
dipeptides of X-Pro type, wherein X is any amino-acid (Tachi et ai; 
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Phytochemistry, 21, 3707-3709, 1992). 
Summary of the invention 

The present invention has for object the new recombinant prolyl-dipeptidyl- 
peptidase (DPP IV) from Aspergillus oryzae comprising the amino-acid sequence 
from amino acid 1 to amino acid 755 of SEQ ID NO:2 or functional derivatives 
thereof. 

In a second aspect, the invention also provides a DNA molecule encoding the 
enzyme according to the invention. 

In a third aspect, the invention provides a cell expressing the enzyme according to 
the invention by recombinant technology. 

In a fourth aspect, the invention provides an Aspergillus naturally providing a 
prolyl-dipeptidyl-peptidase activity which has integrated multiple copies of the 
Aspergillus native promoter which naturally directs the expression of the gene 
encoding the prolyl-dipeptidyl-peptidase activity. 

In a fifth aspect, the invention provides an Aspergillus naturally providing a 
prolyl-dipeptidyl-peptidase activity which is manipulated genetically so that the 
dpplV gene is inactivated. 

In a sixth aspect, the invention provides a method for producing the enzyme 
according to the invention, comprising cultivating the cells of the invention in a 
suitable growth medium under conditions that the cells express the enzyme, and 
optionally isolating the enzyme in the form of a concentrate. 

In a seventh aspect, the invention provides the use of the enzyme or the cells of 
the invention to hydrolyse protein containing materials. 

In another aspect, the invention provides the use of an enzyme and/or a cell 
providing a prolyl-dipeptidyl-peptidase activity, in combination with at least an 
enzyme providing a prolidase to hydrolyse protein containing materials. 
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In a last further aspect, the invention provides a food product comprising a protein 
hydrolysate obtainable by fermentation with at least a microoganism providing a 
prolyl-dipeptidyl-peptidase activity higher than 50 mU per ml when grown in a 
minimal medium containing 1 % (w/v) of wheat gluten 

5 

Detailed description of the invention 

Within the following description, the percentages are given by weight except 
where otherwise stated, and the amino acid or nucleotide sequences referred as 
10 "SEQ ID NO:" are always presented in the sequence listing hereafter. 

Likewise, the expression "functional derivative of an enzyme" includes all amino 
acid sequences which differ by substitution, deletion, addition of some amino 
acids, for instance 1-20 amino acids, but which keep their original activities or 

15 functions. The selection of a functional derivative is considered to be obvious to 
one skilled in the art, since one may easily creates variants of the DPP IV (having 
the amino acid sequence SEQ ID NO:2) by slightly adapting methods known to 
one skilled in the art, for instance the methods described by Adams et al. 
(EP402450; Genencor), by Dunn et al. (Protein Engineering, 2, 283-291, 1988), 

20 by Greener et al. (Strategies, 2, 32-34, 1994), and/or by Deng et al. (Anal. 
Biochem, 200,81, 1992). 

In particular, a protein may be generally considered as a derivative to another 
protein, if its sequence is at least 80% identical to the protein, preferably at least 
25 90%, in particular 95%. In the context of the present disclosure, the identity is 
determined by the ratio between the number of amino acids of a derivative 
sequence which are identical to those of the DPP IV having the amino acid 
sequence SEQ ID NO:2 (mature sequence 1-755), and the total number of or 
amino acids of the said derivative sequence. 

30 

In addition, the term "koji" designates the product of the fermentation with a koji 
mold culture of a mixture of a source of proteins and a source of carbohydrates, 
especially of a mixture of a leguminous plant or of a cooked oleagginous plant and 
of a cooked or roasted cereal source, for example of a mixture of soya or cooked 
35 beans and of cooked or roasted wheat or rice. 
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The present invention thus concerns the new prolyl-dipeptidyl-peptidase enzyme 
originating from Aspergillus oryzae which comprises the amino-acid sequence 
from amino acid 1 to 755 of SEQ ID NO:2 or functional derivatives thereof This 
enzyme may be operably fused to a leader peptide faciliting its secretion in a host 
5 where the enzyme is expressed, for example the Aspergillus oryzae leader peptide 
having the amino-acid sequence from amino acid -16 to -1 of SEQ ID NO:2 or 
functional derivatives thereof. 

A dppYV gene encoding the DPP IV according to the invention may at least 
1 0 comprise the coding parts of the nucleotide sequence SEQ ID NO: 1 , or functional 
derivatives thereof due to the degeneracy of the genetic code. This sequence is in 
fact interrupted by a non-coding sequence, called intron, that is spliced during in- 
vivo transcription (exon I at 1836-1841 bp; exon II at 1925-1924 bp; intron at 
1842-1924 bp). 

15 

A dppTV gene may be obtained in substantially purified form by using the method 
described within the following examples from any strain of Aspergillus oryzae. 
Alternatively, a dpplV gene may be (1) detected also from other genera or species 
of microoganisms by use of DNA probes derived from the nucleotide sequence 
20 SEQ ID NO: l in a stringent hybridization assay, and (2) recovered by the well 
known Reverse-PCR method by use of suitable primers, for example primers SEQ 
ID NO:8 and 9. In a further aspect, a dpplV gene may also be in-vitro synthesized 
and then multiplied by using the polymerase chain reaction, for instance. 

25 The DNA molecule according to the invention at least comprises a dpplV gene 
encoding the DPP IV of the invention. This molecule may be in a form of a 
vector, i.e. a replicative plasmid or an integrative circular or linearized non 
replicative plasmid. The DNA molecule thus may comprise, operably linked to the 
dppTV gene, regulatory sequences native to the organism from which derives the 

30 gene. Said native regulatory sequences may be the promoter, the terminator, 
and/or a DNA sequence encoding a signal sequence that originally regulated the 
secretion of the dppW gene, such as the Aspergillus orzyzae nucleotide sequence 
• coding for a signal peptide from nucleotide 1836 to nucleotide 1966 of SEQ ID 
NO:l (without the intron) or functional derivatives thereof due to the degeneracy 

35 of the genetic code. In another embodiment, regulatory sequences may be native 
sequences that regulate a different gene in the said organism of origin or that 
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regulate a different gene in a foreign organism, for example. A regulatory 
sequence other than the native regulatory sequence will generally be selected for 
its high efficiency or desirable characteristic, for example inducibility of a 
promoter or a sequence encoding a peptide signal which will permit secretion of 
5 the protein. 

If heterologous expression is preferred, meaning that the genes of the invention 
are expressed in another organism than the original host (strain, variety, species, 
genus, family, order, class or division) the regulatory sequences are preferably 

10 derived from an organism similar or equal to the expression host. For example, if 
the expression host is a yeast cell, then the regulatory sequences will be derived 
from a yeast cell. The promoter suitable for constitutive expression, preferably in 
a fungal host, may be a promoter from the following genes: glycerolaldhehyde-3- 
phosphate dehydrogenase, phospho-glycerate kinase, triose phosphate isomerase 

15 and acetamidase, for example.. Promoter suitable for inducible expression, pre- 
ferably in a fungal host, may be a promoter from the following genes: 
endoxylanase IIA, glucoamylase A, cellobiosehydrolase, amylase, invertase, 
alcohol dehydrogenase and amyloglucosidase. The selection of a desirable 
regulatory sequence operably linked to a sequence of the invention and capable of 

20 directing the expression of the said nucleotide sequence is considered to be 
obvious to one skilled in the art. 

The DNA molecule according to the invention may also comprise a selection 
marker to discriminate host cells into which the recombinant DNA material has 

25 been introduced from cells that do not comprise the said recombinant material. 
Such marker genes are, for example in case fungal expression is preferred, the 
known ga-2, pyrG, pyrA, pyrK trpC, amdS or argB genes. The DNA molecule 
may also comprise at least one suitable replication origin. Suitable transformation 
methods and suitable expression vectors provided with a suitable transcription 

30 promoter, suitable transcription termination signals and suitable marker genes for 
selecting transformed cells are already known in the literature for many organisms 
including different bacteria, fungal and plant species. In the event fungal 
expression is required, the expression system described in EP278355 (Novartis) 
may be thus particularly adapted. 

35 

Recombinant koji molds may be obtained by any method enabling a foreign DNA 
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to be introduced into a cell. Such methods include transformation, electroporation, 
or any other technique known to those skilled in the art. 

The invention thus encompasses a recombinant cell comprising the DNA 
5 molecule of the invention, the said cell being able to express the DPP IV of the 
invention or functional derivatives thereof. These cells may be derived from the 
group of fungal, yeast, bacterial and plant cells. Preferably, yeast cells are of the 
genera Saccharomyces, Kluyveromyces, Hansenula and Pichia, bacterial cells are 
Gram negative or positive bacteria, i.e. of the genera Escherichia, Bacillus, 

10 Lactobacillus, Lactococcus, Streptococcus and Staphylococcus, plant cells are of 
the vegetable group, and fungal cells are cells that are tradionnally used for 
making a koji, such as Aspergillus, Rhizopus and/or Mucor species, notably 
Aspergillus soyae, Aspergillus oryzae (ATCC 20386), Aspergillus phoenicis 
(ATCC 14332), Aspergillus niger (ATCC 1004), Aspergillus awamori (ATCC 

15 14331), Rhizopus oryzae (ATCC 4858), Rhizopus oligosporus (ATCC 22959), 
Rhizopus japonicus (ATCC 8466), Rhizopus formosaensis, Mucor circinelloides 
(ATCC 15242), Mucor japanicus, Penicillium glaucum and Penicillium fuscum 
(ATCC 10447). Strains referred by an ATCC number are accessible at the 
American Type Culture Collection, Rockville, Maryland 20852, US. The 

20 invention is not limited by such indications which were rather give to enable one 
skilled in the art to carry out the invention. 

Recombinant cells of the invention may comprise the DNA molecule of the 
invention stably integrated into the chromosome or on a replicative plasmid. 
25 Among all recombinant cells of the invention thus created, the present invention 
has particularly for object the strains A. oryzae CNCM 1-1887, A. oryzae CNCM I- 
1888 and Pichia pastoris CNCM 1-1886. 

Preferably, functional copies of the dppW gene are integrated at a predefined 
30 locus of the chromosomal DNA of the host cell. 

Accordingly, in order to operably integrate into the chromosome of prokaryotic 
cells at least one functional dpplV gene which is not fused to any promoter, the 
DNA molecule of the invention may be integrated by using the process described 
35 in EP564966, i.e., 
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(1) transforming a host strain organism with a donor plasmid which does not 
replicate in the host strain, wherein the donor plasmid comprises a vector 
backbone and a dpplY gene of the invention operably integrated, without any 
promoter, into a part of an operon of the host strain, maintaining the frame and the 
5 function of the genomic operon of the host strain; (2) identifying cointegrate 
transformants in which the complete donor plasmid is integrated into the genomic 
operon of the host strain; and (3) selecting an integrant transformant from the 
cointegrate transformants, wherein the genome of the selected integrant 
transformant does not include the vector backbone of the donor plasmid but does 
10 include the dppW gene, which is operably integrated into the conserved genomic 
operon and which is stably maintained and expressed due to selective pressure on 
the correct functioning of the essential cistron upon growth in a standard medium. 

In a second embodiment, in order to stably integrate into the chromosome of 
15 eucaryotic cells only one functional dppIW sequence which is fused to a promoter 
and a terminator which are native to the host organism, DNA molecule of the 
invention may be integrated by sligthly adapting the process of de Ruiter- Jacobs, 
Y.M.J.T., Broekhuijsen et al (A gene transfer system based on the homologous 
pyrG gene and efficient expression of bacterial genes in Aspergillus oryzae. Curr. 
20 Genet. 16: 159-163, 1989), i.e., 

(1) preparing a non-replicative DNA fragment by ligating the dppW, which is 
operably linked to a promoter and terminator that are native to the host organism, 
downstream a DNA sequence encoding any essential gene, said essential gene 

25 being inactivated by at least a mutation and/or a deletion (this essential gene may 
be a gene involved in uracil biosynthesis, such as the pyrG gene in case A. oryzae 
is used, for example); (2) selecting a host organism containing the essential gene 
which is however inactivated by another mutation(s) or deletion(s); (3) 
transforming said host organism with the non-replicative DNA fragment; (4) 

30 identifying integrate transformants in which the DNA fragment is integrated so as 
to restaure the native function of the essential, gene; (5) selecting an integrate 
transformant in which only one DNA fragment is integrated. 

Progeny of an expression host comprising a DNA molecule according to the 
35 invention is also included in the present invention. Accordingly, a preferred 
embodiment of the invention is directed to a cell comprising a recombinant DNA 
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molecule of the invention in any of the embodiments described above, wherein the 
said cell is able to integrate the DPP IV into the cell wall or the cell membrane or 
secrete the enzymes into the periplasmic space or into the culture medium. The 
secreting route to be followed by the recombinant protein according to the in- 
5 vention will depend on the selected host cell and the composition of the 
recombinant DNA according to the invention. Most preferably, however, the 
protein will be secreted into the culture medium. To this end, the cell according to 
the invention may comprise a recombinant dppTV gene further operably linked to 
a DNA encoding a foreign leader sequence (pre or prepro), for example. 

10 

Cells over-expressing the DPP IV of the invention are preferably choosen, 
especially Aspergillus cells capable of providing at least 50 mU, especially at least 
100 mU, of DPP IV activity per ml of supernatant when grown in a minimal 
medium containing 1 % (w/v) of wheat gluten, such as the MMWG medium. 

15 

These cells may be obtained by incorporation of the DNA molecule of the present 
invention in an expression host, said DNA molecule comprising one or more 
regulatory sequences which serve to increase expression levels of the protein(s) of 
the invention. 

20 

The over-expression can be further achieved by introducing multicopies of the 
DNA molecule of the invention, for example. Surprisingly, Aspergillus cells 
having integrated multiple recombinant functional dppWf genes of the invention 
may provide a DPP IV activity per ml of supernatant which is more than it should 
25 have been compared to the number of integrated copies, probably due to the 
titration of a negatively acting transcription factor. As an example, the Aspergilus 
oryzae tranformant 6 of the following example 1 was deposited under the 
Budapest Treaty at the CNCM where it receives the deposit number CNCM I- 
1888. 

30 

In addition, it has also been shown that over-expression of the DPP IV may be 
achieved in Aspergillus species naturally providing a prolyl-dipeptidyl-peptidase 
activity, by integrating multiple copies of the Aspergillus native promoter which 
naturally directs the expression of the gene encoding the prolyl-dipeptidyl- 
35 peptidase activity. The promoter region of Aspergillus oryzae contained in the 
nucleotide sequence from nucleotide 1 to nucleotide 1835 of SEQ ID NO:l is of 
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particular interest for this purpose. As an example, the Aspergilus oryzae 
transformant B2 of the following example 4 was deposited under the Budapest 
Treaty at the CNCM where it receives the deposit number CNCM 1-1887. 

5 The invention is also directed to a process for producing the DPP IV of the 
invention comprising, providing recombinant cells according to the invention in a 
suitable growth medium under conditions that the cells express the DPP IV, and 
optionally isolating the said recombinant protein(s) in the form of a concentrate. 
The selection of the appropriate medium may be based on the choice of expression 
10 host and/or based on the regulatory requirements of the DNA recombinant 
material. Such media are well-known to those skilled in the art. 

After fermentation, the cells can be removed from the fermentation broth by 
centrifugation or filtration. Depending on whether the host cells have secreted the 

15 DPP IV of the invention into the medium or whether the DPP IV are still 
connected to the host cells in some way either in the cytoplasm, in the periplasmic 
space or attached to or in the membrane or cell wall, the cells can undergo further 
treatment to obtain the recombinant protein. In the latter case, where the 
recombinant enzyme is still connected to the cells, recovery may be accomplished 

20 by rupturing the cells for example by high pressure, sonication, enzymatic 
digestion or simply by cell autolysis followed by subsequent isolation of the 
desired product. The DPP IV can be separated from the cell mass by various 
methods, such as ultrafiltration, and then subsequently precipitated with an 
organic solvent. The isolated DPP IV may be further purified by conventional 

25 methods such as precipitation and/or chromatography. 

The present invention also relates to the use of the: purified DPP IV or the above 
mentioned cells to hydrolyse protein containing materials, such as mixtures of a 
source of proteins and a source of carbohydrates, especially of a mixture of a 

30 leguminous plant or of a cooked oleaginous plant and of a cooked or roasted 
cereal source, for example of a mixture of soya or cooked beans and of cooked or 
roasted wheat or rice. Compositions containing wheat gluten are particularly 
adapted for the purpose of the present invention, since considerable amount of 
glutamine remains sequestered in proline containing peptides when wheat gluten 

35 is hydrolysed by traditional koji cultures. 



WO 99/02705 PCT/EP98/02799 

11 



To obtain a satisfactory degree of hydrolysis, the purified DPP IV may suitably be 
added to the proteinaceous material in a amount of 0.05-15 Unit/ 100 g of protein, 
in particular 0.1-8 Unit/1 OOg of protein. The incubation may be performed at a pH 
from between about 4 and about 10, preferably between about 5 and about 9. The 
5 incubation may be performed at any convenient temperature at which the enzyme 
preparation does not become inactivated, i.e. in the range of from about 20°C to 
about 70°C. 

In addition, in the event one may try, after or during hydrolysis with DPPIV, to 
10 further liberate as much as possible glutamine linked to proline residues, the 
present invention provides a method in which the DPP IV of the invention is used 
in combination with at least an enzyme providing a prolidase activity that is to say 
an enzyme which has a high level of specificity towards dipeptides of the X-Pro 
type (Ezespla et aL, Ap. Env. Microb., £2, 314-316, 1997; Such kind of enzyme is 
15 already available from Sigma: E.C. 3.4.13.9). 

In a further aspect, the present invention relates to a food product comprising a 
protein hydrolysate obtainable by fermentation with at least a microoganism 
providing a prolyl-dipeptidyl-peptidase activity higher than 50 mU per ml when 
20 grown in a minimal medium containing 1 % (w/v) of wheat gluten. 

Important food products of the present invention is an ingredient of a mother milk 
substitute for infants, or a hydrolysed vegetable protein ingredient, i.e. a koji. 
Indeed, if the DPP IV activity (enzyme or microoganism) is combined with other 

25 proteolytic activities (enzymes or microoganisms), i.e. typically if Pichia pastoris 
CNCM 1-1886 or Aspergillus oryzae CNCM 1-1887 or CNCM M888 or enzyme 
purificates thereof are used, high degree of hydrolysis may be obtained leading to 
a non-bitter flavour and a significantly lower allergenicity than unhydrolysed 
proteins. The milk substitute may be further formulated in substantially, the same 

30 way as that indicated in the prior literature for products of this type (cf. EP 
96202475.8). 

The present invention is not to be limited in scope by the specific embodiments 
described herein. Indeed, various modifications of the invention, in addition to 
35 those described herein, will become apparent to those skilled in the art from the 
foregoing description and accompanying figures. Such modifications are intended 
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to fall within the scope of the claims. Various publications are cited herein, the 
disclosures of which are incorporated by reference in their entireties to tlie extent 
necessary for understanding the present invention. DNA manipulation, cloning 
and transformation of bacteria cells are, except where otherwise stated, carried out 
5 according to the textbook of Sambrook et al (Sambrook et al.., Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, U.S.A., 
1989). These examples are preceded by a brief description of the plasmids and 
strains used, and by the composition of various media. The strains A, oryzae TK3, 
A. oryzae transformant 6 (example 1), A. oryzae transformant B2 (example 4), 

1 0 Pichia pastoris containing pKJl 1 5 (example 3) were deposited under the Budapest 
Treaty, at the Collection Nationale de Culture de Microorganismes (CNCM), 25 
rue du docteur Roux, 75724 Paris, France, on June 24, 1997, where they receive 
respectively the deposit numbers CNCM 1-1882, CNCM 1-1888, CNCM 1-1887 
and CNCM 1-1886. All restrictions as to the availability of these deposits will be 

1 5 withdrawn upon first publication of this application or another application which 
claims benefit of priority to this application. 

Strains and plasmids 

20 -Aspergillus oryzae 44 and TK3 originate from the Nestle strain collection. 
However other wild type Aspergillus oryzae strains may also have been used in 
the context of the following examples. 
-A. oryzae NF1 derived from TK3 by targeted disruption (uridine auxotrophe). 
-Aspergillus nidulans 033 (bi Al, arg Al) can be obtained through Fungal Genetic 
25 Stock Center, Glasgow, and is used as a source of pyrG (GenBank accession 
number Ml 9132) gene. However other wild type Aspergillus nidulans strains 
may also have been used in the context of the following examples. 

- The Pichia pastoris (Invitrogen Inc., US) 

- Plasmid pMTL21-H4.6 containing the Aspergillus fumigatus dpplV gene can be 
30 provided by the Institut Pasteur, Paris, France (Beauvais et al, An homolog of 

the CD26 is secreted by the human pathogenic fungus Aspergillus fumigatus, 
Infect, immun. In press., 1997; GenBank EMBL, accession number: V87950). 

- Plasmid pNFF28 contains the A, oryzae TIG pyrG gene (GenBank EBI/UK, 
accession number: Y13811). 



35 - Plasmids pMTL20 (Chambers et al., Gene, £8, 139-149, 1988; GenBank EMBL, 
accession number: M21875), pNEB193 (Biolabs, New England) and 
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pBluescriptSK" (Stratagene, US) were used in subcloning procedures. 
-Plasmid pCL 1920b is a derivative of plasmid pCL1920 (Lerner and Inouye, 
Nucleic Acids Research, jj£, 4631, 1990) in which the multiple cloning site was 
modified to include a Smal site and a EcoRI site between the BamHl and Sail 



-The P. pastoris expression vector pKJ115 was constructed by cloning the 
expression cassette of pPIC9 (Invitrogen) in pCL 1920b. In pKJ115 the 
expression cassette of pPIC9 is flanked by two Smal sites for linearisation of the 
DNA, before transformation of P. pastoris. 



Growth mediifl 

- Aspergillus oryzae can grow on the minimal medium (MM) prepared according 
to Pontecorvo et al (Adv. Genet, 5, 141-239, 1953). 

15 - Aspergillus oryzae NF1 is grown at 35°C on MM containing 10 mM NaN0 3 as a 
nitrogen source and 10 mM uridine. 

- MMWG contains MM plus 1 % (w/v) of wheat gluten (WG) (Sigma), 

- MMWGH contains MM and 0.1 % (w/v) WG (Sigma) plus 0.1 % (w/v) WG 
hydrolysate prepared hydrolysing non-vital wheat gluten powder (Roquette, 

20 France) with Alcalase 2.4L (Novo Nordisk, Denmark). Hydrolysis is conducted 
at 20 % (w/w) substrate concentration and an enzyme to substrate ratio (E/S) of 
1:50 (by weight of protein) for 6 h at 60°C and constant pH of 7.5 (pH stat). 
Alcalase is then heat inactivated at 90°C for 10 min. After centrifugation of the 
hydrolysate, the supernatant is lyophilised to give WGH and stored at room 

25 temperature. WGH contains mainly peptides and only minimal amounts of free 
amino acids. Peptide mass distribution in WGH is from 200 to 10*000 Da, 
determined by size-exclusion chromatography on a Superdex Peptide column. 

- P. pastoris can grow on RDB. (Regeneration Dextrose Base): 1M sorbitol, 1 % 
(w/v) dextrose, 1.34 % (w/v) yeast nitrogen base (YNB), 4 x 10* 5 % (w/v) 

30 biotine, 5 x 10° % aa (i.e. 5 x 10" 3 % (w/v) of each L-glutamic acid,L- 
methionine, L-lysine, L-leucine and L-isoleucine. 

- MMM (Minimal Methanol Medium): 1.34 % (w/v) YNB, 4 x 10' 5 % (w/v) 
biotine, 0.5 % (w/v) methanol. 

- BMGY (Buffered minimal Glycerol-complex Medium): 1 % (w/v) yeast extract, 
35 2 % (w/v) peptone, 10 mM potassium phosphate pH 6.0, 1.34 % (w/v) YNB, 4 x 

10' 5 % (w/v) biotine, 1 % (w/v) glycerol. 



5 



sites. 



10 
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- BMMY : (Buffered minimal Methanol-complex Medium): 1 % (w/v) yeast 
extract, 2 % (w/v) peptone, 10 rnM potassium phosphate pH 6.0, 134 % YNB, 4 
x 10* 5 % (w/v) biotine, 0.5 (w/v) % methanol. 

5 Example 1 Cloning of the dpplV 

- Screening of a genomic library : a genomic DNA library was prepared using the 
DNA from A. oryzae 44 and screened with a DNA fragment containing the dpplV 
gene of Aspergillus fumigatus (Beauvais et al, GenBank EMBL, accession 

10 number: V87950). 

For this purpose, the isolation of the genomic DNA was performed according to a 
modified protocol of the method described by Raeder and Broda (Let. appl. 
Microbiol., I, 17-20, 1985). Mycelium was harvested by filtration, immediately 

1 5 frozen in liquid nitrogen and lyophilised. It was then grinded to a fine powder 
using a mortar and pestle. 200 mg of the powdered mycelium was resuspended in 
2.5 ml of extraction buffer (200 mM Tris-HCl pH 8.5 150 mM NaCl, 25 mM 
EDTA, 0.5 % SDS) and the solution was extracted with 1.75 ml extraction buffer- 
equilibrated phenol and 0.75 ml of chloroform/isoamylalcohol (24:1, v/v). The 

20 mixture was centrifuged (20 min, 3000 g). The aqueous phase was retrieved and 
incubated with 125 \x\ of RNAse A (Boehringer) solution (10 mg/ml) for 10 min at 
37°C. 1.25 ml of 2-propanol (Merck) were then added. The pellet was washed 
with 70 % ethanol and finally resuspended in 500 ml of TE buffer (10 mM Tris- 
HCl pH 8.0, 1 mM EDTA). 500 \i\ of 2 x QBT (1.5 M NaCl, 100 mM MOPS, 30 

25 % ethanol, pH 7.0) were added to the sample which was then applied to a 
"Genomic-tip" (Qiagen), rinsed and eluted as recommended by the supplier. 

The genomic DNA was then partially digested with Sau3A, and DNA fragments 
of 12-20 kb were isolated from low melting agarose (Biorad). These fragments 
30 were inserted into bacteriophages using the X EMBL3 BamHl arm cloning system 
(Promega, US). 

40000 recombinant plaques of the A. oryzae 44 genomic library in X EMBL3 were 
immobilised on nylon membranes (Genescreen, Dupont). These filters were 
35 probed, with the 32 P-labelled 2.3 kb dpplV insert of pMTL21-H4.6 amplified by 
PGR in a 5 x SSC solution containing 20 % formamide, 1 % sodium dodecyl 
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sulfate (SDS), and 10 % dextran sulfate at 42°C for 20 h. Labelling of DNA was 
performed using a random-primed DNA labelling kit (Boehringer) and (a 32 P)- 
dATP. The membranes were exposed to X-ray film after two 20 min washes in 3 x 
SSC-1 %SDS at40°C. 

5 

Ten positive clones were isolated and purified. Restriction enzyme analysis of 
purified bacteriophage DNA revealed that the clones carried similar but not 
identical DNA fragments. By Southern analysis, the dppYV gene was assigned to 
an Apal-EcoKV 4.8 kb fragment which was subcloned into pBluescriptSK", 
1 0 creating the plasmid pNFF 125. 

- Checking of functionalities: plasmid pNFF125 was introduced into A. oryzae 
NF1 by cotransformation with plasmid pNFF28, canying the pyrG gene for 
selection of transformants. 

15 

For this purpose, A, oryzae NF1 was grown overnight in MM with 50 mM 
glucose, 5 mM glutamine and 10 mM uridine. The mycelium was harvested by 
sterile over cheese cloth filtration, washed once with sterile double distilled water 
and once with K0.8MC (20 mM MES-HC1 pH 5.8, 0.8 M KC1, 50 mM CaCl2). 2 

20 g of mycelium were resuspended in 20 ml of a filter sterilised 5 mg/ml solution of 
Novozyme 234 in K0.8MC. The mycelium suspension was incubated at 30°C for 
2 hours with gentle agitation (120 rpm). The protoplasts were liberated from the 
mycelium by gentle resuspension with a pipette, washed twice with 20 ml of 
S1.0TC (10 mM Tris-HCl pH 7.5, 1 M Sorbitol, 50 mM CaCl2) and were 

25 resuspended at a final concentration of 10 8 /ml in S1.0TC. 20 ml of DNA was 
mixed with 200 ml of protoplasts and 50 ml of 25 % PEG 6000 (BDH) in 10 mM 
Tris-HCl pH 7.5, 50 mM CaCl2 and incubated for 20 min on ice. To this mixture, 
2 ml of 25 % PEG 6000 in 10 mM Tris-HCl pH 7.5, 50 mM CaCl2 were added, 
gently mixed and incubated for 5 min at room temperature. 4 ml of S1.0TC was 

30 . added and 1 .0 ml aliquots were mixed with 5 ml of 2 % low melting point agarose 
(Sigma) SMM (MM plus 50 mM glucose and 5 mM glutamine, osmotically 
stabilised with 1.0 M sucrose) and plated onto SMM agar (Difco). 

Ninety-five pyrG* transformants were screened for DPP IV activity after 
35 incubation (2 days, 30°C) on MMWGH. For this purpose, spores of transformants 
were resuspended in SP2 buffer (20 mM KH 2 P0 4 adjusted to pH 2.0 with HC1 and 
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0.9 % NaCl) in microtiter plates and replica plated onto Petri dishes containing 
MMWGH covered by a Whatman filter (Chrl). The plates were incubated for 2 
days at 30°C. DPP IV activity was detected on the filter according to Lojda 
(Histochemistry, 54, 299-309, 1977) and Aratake et al. (Am. J. Clin. Pathol., 26, 
5 306-310, 1991). Filters were reacted with a solution of 3 mg glycyl proline 4-(i 
naphthylamide (Bachem) in 0.25 ml N, N-dimethylfonnamide (Merck) and 5 mg 
o-dianisidine, tetrazotized (Sigma) in 4.6 ml 0.1 M sodium phosphate buffer pH 
7.2 for 10 min at room temperature. Endoproteolytic enzyme activity was also 
measured with resorufin-labeled casein according to Boehringer method 

10 description supplied with the substrate (Resorufin-labeled casein, Cat.No. 
1080733). Leucine aminopeptidase and dipeptidyl peptidase IV activities were 
determined by UV spectrometry with synthetic substrates Leu-pNa and Ala-Pro- 
pNa (Bachem, Switzerland), respectively, according to Sarath et al (Protease 
assay methods in Proteolytic enzymes: a practical approach, IRL Press, Oxford, 

15 1989). 10 mM substrate stock solution in dimethylsulfoxide (DMSO) was diluted 
with 100 mM sodium phosphate buffer, pH 7.0, to a final concentration of 0.5 
mM. 20-100 jal culture medium supernatant was added and reaction proceeded for 
up to 60 min at 37°C. A control with blank substrate and blank supernatant was 
done in parallel. The release of the chromophoric group 4-nitroaniline (e: 10'500 

20 M^cm" 1 ) was measured at 400 nm and activities were expressed as mU/ml 
(nmol/min/ml). 

Results show that sixteen transformants exhibited a clearly increased staining 
compared to the wild type. Seven transformants numbered 1 to 7 were selected 

25 because of their high DPP IV activity. Southern blots of them confirmed that the 
increase in the activity was due to the integration of multiple copies of the 4.8 kb 
ApahEcoKV fragment in the genome of the transformants. From densitometric 
scans of these Southern blots, it was estimated that in transformant 1 , at least 4 
additional copies had been functionally integrated into the genomic DNA, while, 

30 in transformant 6, they were at least 9 additional copies. 

To quantify the increase of DPP IV activity in the transformants 1 and 6, these 
were grown in parallel with control A. oryzae NF1 pyrG*, for 7 days at 30°C 
without shaking in 100 ml liquid MMWG. Analyses of the supernatants are shown 
35 in Table 1. Transformants 1 and 6 showed a DPP IV activity of at least 8 and 17 
times more, respectively, than A. oryzae NF1 pyrG* transformant, while their 
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leucine-aminopeptidase (LAP) and endopeptidase (ENDO) activities remain 
unchanged. These data strongly suggested that pNFF125 contained a functional 
dppTV gene. In addition, when a functional gene was introduced, the DPP IV 
activity increased more than it should have been compared to the number of 
5 integrated copies. The difference might also come from the titration of a 
negatively acting transcription factor (repressor). 



T abl e I 





DPP IV [mU/ml] 


LAP [mU/ml] 


ENDO [mU/ml] 


NFl pyrG + 


8.7 


1.6 


2.9 


Transformant 1 


73.9 


1.7 


3.3 


Transformant 6 


160.6 


1.9 


3.1 



10 - Characterisation of the DPP IV: culture broth from prolyl dipeptidyl peptidase 
overproducing transformant 6 and the control A, oryzae NFl pyrG* were analysed 
by SDS-PAGE. No single band in the prolyl dipeptidyl peptidase-overproducing 
strain stained more intensely than the A. oryzae NFl pyrG* control. However, a 
broad smear was visible in the region around 95 kDa of the prolyl dipeptidyl 

15 peptidase-overproducing strain, but not in the A. oryzae NFl pyrG* control. This 
aberrant electrophoretic behaviour might be caused by glycosylation of the 
enzyme. Therefore, culture broths were treated with N-glycosidase F and 
reanalysed. In the deglycosylated samples a band of 85 to 90 kDa appeared in the 
control NFl pyrG* and in the prolyl dipeptidyl peptidase overproducing 

20 transformant. A sample of the N-glycosidase F treated culture medium of 
transformant 6, corresponding to 100 mU prolyl dipeptidyl peptidase activity, was 
loaded onto a preparative gel and blotted onto an Immobilon P SQ membrane. The 
putative prolyl dipeptidyl peptidase band was excised and analysed by automated 
Edman degradation. The N-terminal sequence of the mature protein was 

25 determined to be Leu-Asp-Val-Pro-Arg-... . 

- Sequenci ng of the Ap al-EcoRV fragment: the 4.8 kb fragment from pNFF125 
was sequenced on both strands. The nucleotide sequence of the dppTV gene was 
determined, on a Li cor model 4000 automatic sequencer. IRD41 labelled primer 
30 having the nucleotide sequence SEQ ID NO:3 was used for sequencing both 
strands of partially overlapping subclones by the dideoxynucleotide method of 
Sanger et al (Proc. Natl. Acad. Sci. USA, 24, 5463-5467, 1977). The DNA 
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sequence analysis was performed by using the GCG Computer programs 
(Devereux et a/., Nucl. Acids Res., 12, 387-395, 1987). 

The position of transcription start sites were mapped by primer extension. 
5 Additionally the position of exons and intron were determined by RT-PCR. For 
this purpose, total RNA was isolated from the A. oryzae TK3 mycelia cultured 
overnight on MMWGH, using the "RNeasy Total RNA Purification kit" (Qiagen). 
Reverse transcriptase PCR (RT-PCR) was performed using the "1st strand cDNA 
synthesis kit for RT-PCR" (Boehringer). 10 \ig of total RNA, 1 x reaction buffer 

10 (10 mM Tris, 50 mM KC1 pH 8.3), 5 mM MgCl 2 , 1 mM deoxynucleotide mix, 1.6 
|ig oiigo-p(dT)i5 primer, 50 units RNAse inhibitor, 10 units AMV Reverse 
transcriptase were mixed and incubated 25°C 10 min, 42°C 60 min, 75°C 5 min 
and 4°C 5 min. 1 2 \x\ and 3 yd of the obtained cDNA, 2 mM of 
oligonucleotides and 250 mM dNTPs (Boehringer) were dissolved in 50 ml of 1 x 

15 PCR buffer (20 mM Tris-HCl pH 8.55, 16 mM (NH^SO^ 2.5 mM MgCl 2 , 150 
mg/ml BSA). To each reaction 1.5 unit of Taq-polymerase (Biotaq) were added as 
well as one drop of Nujol mineral oil (Perkin Elmer). The targeted region of the 
dppW gene was amplified, using a Stratagene Robo Cycler gradient 40, with the 
primer pair SEQ ID NO: 4 and SEQ ID NO:5. The reaction mixtures were 

20 subjected to 2 cycles of 1 min 98°C, 2 min 56°C and 2 min 72°C, followed by 27 
cycles of 1 min 94°C, 1 min 56°C and 2 min 72°C and 1 cycle of 1 min 94°C, 1 
min 56°C and 10 min 72°C. The gel purified PCR products were recovered with 
Qiaex II (Qiagen) and directly ligated into the pGEM-T vector (Promega) 
according to the instructions of the manufacturer, to generate plasmid pNFF137. 

25 

Results show that the open reading frame (ORF) is split by a 83 bp intron into 2 
exons. Furthermore, the 16 aa long N- terminal secretory signal sequence was 
identified by homology with the A.fumigatus sequence which corresponds well to 
the signal sequence rule described by Von Heijne (Nucleic Acids Res., 14, 4683- 

30 4690, 1986). The dppTV gene has the nuclotide sequence SEQ ID NO:l, and 
encodes a mature protein of 755 aa with a deduced molecular weight of 85.4 kDa 
(see SEQ ID NO:2). The signal sequence of dpplV runs from position 1835 
(ATG) to 1966 and includes the intron. The mature protein starts at position 1967 
with the amino acid sequence LeuAspValProArg as confirmed by Edman 

35 degradation. The exon 1 starts at position 1836 and ends at poistion 1841; intron 
starts at position 1842 and ends at poistion 1924; exon II starts at position 1925 
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and ends at position 423 1 . 

Example 3 Disruption of the dppIW gene 

5 In order to determine if the cloned dppYW gene was exclusively responsible for the 
DPP IV activity observed onto MMWGH, it was disrupted. 



As heterologous selection marker, to prevent targeting of the disrupting construct 
to the pyrG locus, the A. nidulans pyrG gene was amplified from A nidulans 033. 

10 To do so, the sequences between position 500 and 2342 of the pyrG gene (Oakley, 
et al., Gene, £1, 385-399, 1987) were amplified by PCR. 200 ng A. nidulans 033 
genomic DNA, 2 mM of oligonucleotides and 250 mM dNTPs (Boehringer) were 
dissolved in 50 ml of 1 x PCR buffer (20 mM Tris-HCl pH 8.55, 16 mM 
(NH 4 ) 2 S0 4 , 2.5 mM MgCl 2 , 150 mg/ml BSA). To each reaction 1.5 unit of Taq- 

15 polymerase (Biotaq) were added as well as one drop of Nujol mineral oil (Perkin 
Elmer). The targeted region was amplified, using a Stratagene Robo Cycler 
gradient 40, with the primer pairs SEQ ID NO:6 and SEQ ID NO: 7. The reaction 
mixtures were subjected to 30 cycles of 1 min 95°C, 1 min 52°C and 3 min 72°C. 
The gel purified 1 .8 kb PCR product was recovered with Qiaex II (Qiagen) and 

20 cloned into pGEM-T (Promega), according to the instructions of the manufacturer, 
to give pNFF39. 

In parallel, a mutant allele of dppW was generated from pNFF 125 by replacing 
the internal 1.5 kb Ncol fragment with the 1.8 kb Ncol fragment from pNFF39, 
25 creating pNFF129. 



Apal-EcoRV digested pNFF129 was introduced into A. oryzae NF1 and the 
transformants were grown on MM. Among 95 tested on MMWGH, eighteen 
transformants did not exhibit DPP IV activity. Six DPP IV negative transformants 

30 were selected and numbered from 8 to 13, and four transformants which still 
exhibited DPP IV activity were numbered from 14 to 17. A Southern blot of Ncol 
digested genomic DNA from these ten transformants was probed with the dpplV 
PCR fragment (see example 2). In transformants which did not exhibit DPP IV 
activity, the 1.5 kb Ncol fragment is absent, which proves that the wild type gene 

35 has been replaced by the disruption construct. In transformants which retain DPP 
IV activity, the 1.5 kb fragment is still present, and hybridising fragments with 
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other molecular weights show that the disruption construct has integrated at 
another site in the genome. 

To quantify DPP IV activity, transformants 10, 11 and 15 as well as A. oryzae 
5 NF1 pyrG + transformant were grown for 7 days at 30°C on liquid MMWG. 
Enzymatic analyses of the supernatant (table 2) showed that transformants 10 and 
1 1 had residual proline dipeptidyl-peptidase activity, probably due to some non 
specific enzymes. By contrast, transformant 15 had a higher DPP IV activity (at 
least 4 times more) compared to the wild type. Inspection of the original screen for 
10 DPP IV disruption mutant revealed additional clones with higher activity 
compared to the wild type. Since the disruption construct did not contain a 
functional gene, the increase of the activity might have been due to titration of a 
repressor. 

15 Tafrl<?2 





DPPIV[mU/ml] 


LAP [mU/ml] 


ENDO [mU/ml] 


NFl pyrG* 


8.7 


1.6 


2.9 


Transformant 10 


0.4 


5.8 


3.1 


Transformant 1 1 


0.1 


6.5 


4.5 


Transformant 15 


39.6 


5.7 


2.7 j 



Example 3 Expression of A. oryzae DPP IV in P. pastoris 

- Transformation of P. pastoris : plasmid pNFF125 was used as template for 
20 multiplying the dpp\V gene by PCR. To do so, 200 ng of pNFF125 DNA, 164 
pmol of oligonucleotides, 120 mM dNTP's were dissolved in 50 ml PCR buffer 
(20 mM Tris-HCl pH 8.8, 2 mM MgS0 4 , 10 mM (NH 4 ) 2 S0 4 , 0.1 % Triton X-100, 
100 mg/ml nuclease free BSA). A drop of dynawax (Dynazyme) was added. To 
each reaction 2.5 unit of cloned Pfu DNA polymerase (Stratagene) was added in 
25 50 ml of 1 x PCR buffer. The A. oryzae dppYW gene was amplified with the primer 
pair SEQ ID NO:8 and SEQ ID NO:9 (these primers covered N- and C- terminal 
mature protein coding region). The reaction mixtures were subjected to thirty 
cycles of 1 min 95°C, 1 min 44°C and 3 min 72°C using Perkin Elmer DNA 
Thermal Cycler. 

30 

The PCR product was digested by EcoKV and Noil and cloned into the SnaBl, 
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Notl digested pKJl 15, generating the plasmid pNFF134. P. pastoris sphaeroplasts 
were transformed with 10 \xg of pNFF134 linearised by EcoRl as described in the 
Manual Version 2.0. of the Pichia Expression Kit (Invitrogen). 

5 The P. pastoris expression cassette pKJl 15 can insert into the P. pastoris genome 
via homologous recombination at the alcohol oxidase (AOX1) site and carry, in 
addition to the cloned coding sequence of interest, the his4 gene for selection. 
Transformants were first selected on histidine-deficient media (RDB) and then 
screened for insertion of the construct at the aox\ site on minimal methanol plates 

10 (MMM). Transformants that were unable to grow on media containing only 
methanol as a carbon source (BMMY) were assumed to contain the construct in 
the correct yeast genomic location by integration events at the aoxl locus 
displacing of the aoxl coding region. The selected transformants were grown to 
near saturation (OD 20 at 600 nm) at 30°C in 10 ml of glycerol-based yeast media 

1 5 (BMGY). Cells were harvested and resuspended in 2 ml BMMY and incubated for 
2 days. After two days of incubation, the supernatant was harvested and 10 ml was 
analysed by SDS-PAGE according to the method of Laemmli (1970) with a 
separation gel of 7.5 % (w/v) polyacrylamide to identify succesfully expressing 
clones. In parallel, the supernatant was checked for activity. 

20 

Results show that the obtained concentration of DPP IV was 100 |ig/ml The 
activity measured in the supernatant was of about 1385 mU/ml. Among all the 
transformants, one was deposited under the Budapest Treaty at the Collection 
Nationale de Cultures de Microorganismes (CNCM), 25 rue du Docteur Roux, 
25 75724 Paris, France, on June , where it receives the deposit number CNCM 1-3. 

- Peptide profiling bv size exclusion chromatography (SEP: the efficiency of 
DPP IV towards peptides in WG hydrolysates was tested. Enzymes in the 
supernatant of dppW disruptant 1 1 thus were heat inactivated at 95 °C for 10 min. 

30 140 mU of purified DPP IV produced by P. pastoris CNCM 1-3 were added to 
500 ^1 of supernatant and incubated at 45°C up to 24 h. A control experiment 
without DPP IV addition was performed in parallel. Aliquots were taken at 2 h 
intervals, acidified with 10 % TFA, centrifuged and analysed by SEC on a 
Superdex Peptide HR 10/30 column (Pharmacia Biotech, Sweden). Separation is 

35 based on molecule size of amino acids and peptides (range: 1 00-7*000 Da).. 
Chromatography was performed under isocratic conditions with 0. 1 % TFA, 20 % 
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acetonitrile in water at a flow rate of 0.5 ml/min. Detection of amino acid and 
peptide peaks was at 215 nm. Peptide and amino acid standards were used to 
calibrate the chromatographic system (data not shown). 

5 Results show that an initial increase of small peptides (200-500 Da) can be 
detected already after 2 h incubation. Extended incubation (up to 24 h) releases 
more dipeptides. No changes are detected in the control sample at 2 h and 24 h 
incubation time. Therefore, it is clear that DPP IV activity liberates dipeptides 
from wheat gluten hydrolysates confirming the efficiency of this enzyme in 
1 0 peptide degradation 

Example 4 Transformation with the native promoter of dppW 

The plasmid pNFF126 containing the fragment of 2094 bp Apal-BarnHl 
1 5 encompassing the promotor region and the start of the DPP IV gene (see SEQ ID 
NO:l) was introduced into A. oryzae NF1, using pyrG gene as selection marker. 
The A. oryzae NF1 pyrG+ transformants were screened by staining for their 
prolyl-dipeptidyl-peptidase activity. Two transformants (B2, G7) showed a more 
intensive stain than the other ones. They were therefore cultured onto liquid 
20 MMWG for 7 days, 30 °C, without shaking, in parallel with three other randomly 
picked transformants and the control A. oryzae NF1 transformed with only pyrG. 

The prolyl-dipeptidyl-peptidase activity was analysed from the culture broths 
Results show that transformants B2 and C7 respectively showed a fourfold and 

25 twofold increase of the prolyl dipeptidyl peptidase activity compared to the 
control, whereas all the other ones do not exhibit any increase of this activity. In 
the disruption experiment (see example 2), also a maximum of fourfold increase 
of the prolyl-dipeptidyl-peptidase activity was noticed (transformant 15). This 
increase can be due to a repressor titrated by the multicopies of the promotor 

30 region integrated heterologously in the genome of A. oryzae NF1 or by a positive 
acting factor encoded by the 2094 bp Apal-BamHI fragment. 

Example 5 Functional derivatives of the DPPIV 



35 Functional derivatives of the DPP IV (SEQ ID NO:2) are prepared according to a 
method adapted from the method described by Adams et al. (EP402450; 
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Genencor). Briefly, the expression cassette pKJ115 containing the DPP IV was 
subjected to an in-vitro chemical mutagenesis by hydroxylamine. According to 
example 3, the mutagenised DNA was then used to transform P. pastoris. 
Functional derivatives of the DPP IV, presenting a deletion, addition and/or a 
5 substitution of some amino acids, were finaly detected according to their peptide 
profile obtained by hydrolysing wheat gluten with purified DPP IV derivatives 
(see example 3). 

Examples 6 

10 

For preparing a fermented soya sauce, a koji is prepared by mixing an Aspergillus 
oryzae CNCM 1-1 koji culture with a mixture of cooked soya and roasted wheat, 
the koji is then hydrolysed in aqueous suspension for 3 to 8 hours at 45°C to 60°C 
with the enzymes produced during fermentation of the Aspergillus oryzae CNCM 
15 1-1 culture, a moromi is further prepared by adding suitable amount of sodium 
chloride to the hydrolysed koji suspension, the moromi is left to ferment and is 
then pressed and the liquor obtained is pasteurized and clarified. 

Example? 7 

20 

For producing a flavouring agent, a aqueous suspension of a mixture of cooked 
soya and roasted wheat is prepared, the proteins are solubilized by hydrolysis of 
the suspension with a protease at pH6.0 to 1 1 .0, the suspension is heat-trated at pH 
4.6 to 6.5, and the suspension is ripened with enzymes of a koji culture fermented 
25 by Aspergillus oryzae CNCM 1-2. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

(A) NAME: SOCIETE DES PRODUITS NESTLE 

(B) STREET: AV. NESTLE 55 

(C) CITY: VEVEY 

(D) STATE: VAUD 

(E) COUNTRY: SWITZERLAND 

(F) POSTAL CODE (ZIP) : CH-1550 

(ii) TITLE OF INVENTION: CLONING OF THE PROLYL -DIPEPTIDYL- PEPTIDASE 

OF ASPERGILLUS ORYZAE 
(iii) NUMBER OF SEQUENCES: 9 
(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 1836.. 1841 
(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 1925.. 4231 
(ix) FEATURE: 

(A) NAME/ KEY: intron 

(B) LOCATION: 1842.. 1924 
(ix) FEATURE: 

(A) NAME/ KEY: sigjpeptide 

(B) LOCATION: 1836.. 1841 
(ix) FEATURE: 

(A) NAME/ KEY : sig_peptide 

(B) LOCATION: 1925.. 1967 
(ix) FEATURE: 

(A) NAME/KEY: promoter 

(B) LOCATION: 1..1835 
(ix) FEATURE: 

(A) NAME/ KEY: terminator 

(B) LOCATION: 4232.. 4771 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

GGGCCCTGAG TTTAACGGTG CTGGGTGTGT TATTACGCAT CATACTCTTC ACCCGCCTTG 60 

CAGTAGTTCG GTTCTATTGT CAATAGCTGC TGTCGCAATA TTCTGTCTTT TGCCAATAAG 12 0 

GTGACCAGGA GGGGTCTTTC CAGGATAGAT AGATGGCGAC ATTTATCTCG TCGCGGCGGT 180 

GATTGTCTGT TTGATTGATG ATGATCTCTG AAACATGTTG AATCTGGGGT ACGTAACTTG 240 

GGGTGATCAA TTGACATCCA CTTAGATATG GTACAGCAAA GTATACCTCC TGGATTCTGT 300 
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/— I 7\ -TV ^1 IV TV /~1 TV TV m 

QAACAAGAAT 


TV m TV TV TV TV m TV TV i—> 

ATAAAATAAG 


CCTCGCGACC 


GGGAGTCTTG 


TCCCTCAAAT 


CATCACAATC 


360 




C C ATCGAACA 


TCCGCATCTA 


ATTTCCTCAC 


TCATCCTTCT 


ATCCACCGCC 


AAAATGAAGG 


420 




CCGCTACCCT 


CCTCTCTCTT 


PTPUPPpfTH 

CTGAGCGTTA 


CCGGACTCGT 


CGCCGCTGCT 


CCAGCTGGCA 


480 




TV PPPT1\ fVP 7\ 

ACGGTACGTA 


TCCTGAACGA 


f-% TV TV fYV "I*7V. TV ^"t TV 

CAATGTAAGA 


CG CTTG ACTG 


ATGATTAGTA 


GGCCCAGCTG 


540 




ftn^r^* 7\ 7\ T»/*t T\ rri 

G TGGAATCAT 


CGACCGCGAT 


CTTCCCGTCC 


CTGTCCCTGG 


ACTCCCTACC 


AAGGGTCTCC 


600 




CTATTGTTGA 


CGGATTGACT 


GGCGGCAATA 


TV r*y /™i /~i y^i ^ tv 

AGGGTGGCGA 


GAAGCCTGGA 


AGCAAGGTTA 


660 




CTCCTCGTGA 


tv mv no^imTi 

AGACCCTACC 


GGCAGCGCCC 


CTGATGGCAA 


GGGCAATGAT 


GGCCCCGACG 


720 




rimn t\ m/lmm tv /"» 

GTGATCTTAC 


CGGACGTCCC 


GGTCAAGGGG 


GTCTTGACAA 


CCCTTTCGAT 


CTCCCTACTC 


780 




CAGAGCTTCC 


TCCCGTCAAG 


CTTCCTGGCG 


GACTTGACGG 


TGGCAAGGGC 


GGTCTCGGCC 


840 


on 


TTCGTCGTCG 


m/^/^^i'TX ^T /"l ^T })v 

TGGCAGCCCA 


GTAGACGGTC 


TCCCTGTCGT 


TGGGCCTGTT 


GTTGGTGGTG 


900 




TTCTAGGTGG 


CGGTGGTGCT 


GGCAGTGGTG 


CTGGTGCCAA 


GGGTGGTGCT 


GGTAGTGGTA 


960 




CCGTTGGGCG 


TCGTGGCAGC 


/T /T TV m TV T TV jr^ 

CCAGTAGACG 


GTCTCCCTGT 


TGTTGGGCCT 


GTTGTTGGTG 


1020 




GTGTCCTAGG 


TGGCGGTGGT 


GCTGGCAGTG 


GTGCTGGTGC 


CAAGGGTGGT 


GCTGGTAGTG 


1080 




GTACCCCTAA 


GCGCCGTGAC 


GGTCCAGTGG 


ACGGTGTTCC 


TGTCGTTGGA 


GAGCTTGCTG 


1140 


on 
oU 


AAGGTGCTAC 


TGGAGGTCTT 


CTAGGTGGTG 


ATGCTGGTTC 


TGCTGATGCT 


GCTGGTGCTG 


1200 




ATGCTGGTGC 


TGATGCTGGT 


GCTGGTGCTG 


GTGGGCAATA 


GTCTAACAAG 


GGCTTTACGG 


1260 




CATCAATGTG 


AGGTTATCCA 


ACATCCATCC 


TTGGTGGCCA 


TTCGTAAATA 


GCAACAAAGA 


1320 




GGGGTGGTAC 


TTGGTCGCGA 


TGTCATTGCT 


CCTGCGATTG 


AAGCTAGCGA 


TTCCTGTATG 


1380 




tv * tv m t\ t\ 1 1 w ■ * 

TACAATAATT 


TTAAG C ACG C 


TTGGTTCCAT 


ACTGTTTCTT 


CACTGGTTTT 


TGGATATTTT 


1440 


ACi 
*t\J 


TTCACTTATT ' 


^ TV TV m^imm^im^ 

GAATCTTGTA 


GTAGTCCAGC 


TTCTCATGGT 


TAGACACGGG 


ATAACCCCCC 


1500 




Ti t\fT7lv /init mATi 

AATAGCATCA 


TCTGCAGGTT 


TGATGTTGCA ATGGTCAAGT 


TTTGTCTTAA 


ATTATGTACG 


1560 




AGTCTTGGGT 


TACCCCGCTA 


GAAGCTTTGC 


CACCAATGAA 


GCTGTTGCTT 


GTCCAACGGC 


1620 




TATCAGCGGT 


TTTTTTTATG 


AGAATCTTGG 


CAGGATAGGA 


AAAGTTGGTG 


GTGGTGAAGG 


1680 




AGCTAATGCA 


GGAGGTGGAG 


TGACTGATAA 


GACGCGATTT 


CTGCGGGGAA 


AAAGAAAAAG 


1740 




GACCAATTTA 


TGGGACTATT 


TATTTAAACG 


GGAAGTCTTC 


AATTCCGTTC 


GCCAGCCATC 


1800 




■ CCTTGATTCG 


AG CTG AAC TC 


GGGGTTTTTT 


CCACCATGAA 


/■I APTI1V /**1/"1IT^J**1 TV TV 

GGTACGTCAA 


TTCCACTGAT 


1860 




TAAACATTAT 


TTGTTACATA 


CACTCCATCA 


TTGAGTCAAT 


m tv m tv Tv mm tv Tv /"i 

TATAATTAAC 


ACCTCATAAT 


1920 




TCAGTACTCC 


AAGCTTCTGC 


TGCTCCTGGT 


CAGTGTGGTC 


CAGGCCCTGG 


ATGTGCCTCG 


1980 




P7\TVAPPRPAP 

G AAAC C AC AC 


GCGCCCACCG 


GAGAAGGCAG 


TAAGCGTCTC 


TV ^1 / ff f If Il^i TV IV fTT^t 

ACCTTCAATG 


AGACCGTAGT 


2040 


60 


CAAGCAAGCA 


ATTACGCCGA 


CCTCTCGCTC 


GGTGCAATGG 


CTCTCGGGCG 


CAGAGGATGG 


2100 




ATCCCTACGT 


GTACGCGGCG 


GAAGACGGCA 


GTCTCACCAT 


CGAGAACATC 


GTCACCAACG 


2160 




AGTCACGCAC 


GCTCATCCTG 


CGGACAAGAT 


TCCGACAGGG 


AAGGAAGCGT 


TCAATTACTG 


2220 




GATCCATCCC 


GACTTGTCGT 


CGGTGCTGTG 


GGCGTCCAAC 


CACACCAAGC 


AGTATCGGCA 


2280 




TTCGTTCTTT 


GCCGATTATT 


ACGTCCAGGA 


TGTGGAGTCA 


CTCAAGTCCG 


TGCCCCTGAT 


2340 


70 


GCCCGATCAG 


GAAGGTGATA 


TTCAATATGC 


CCAATGGAGC 


CCCGTGGGCA 


ATACCATCGC 


2400 




TTTTGTTCGC 


GAGAATGACC 


TTTATGTCTG 


GGATAATGGT 


ACCGTTACTC 


GCATTACTGA 


2460 
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TGATGGTGGC 


C CCG AC ATGT 


TCC ACGG CGT 


GCCGGACTGG 


ATCTATGAAG 


AGGAGATCCT 


2520 




CGGCGATCGC 


TACGCGTTGT 


GGTTCTCGCC 


AGATGGTGAA 


TATCTGGCTT 


ACTTGAGCTT 


2580 




CAATGAGACT 


GGGGTTCCGA 


CCTACACCGT 


TCAGTATTAT 


ATGG ATAAC C 


AAGAGATCGC 


2640 




TCCGGCGTAT 


GCATGGGAGC 


TGAAGATAAG 


GTATCCCAAG 


GTGTCGCAGA 


CGAATCCGAC 


2700 


I u 


CGTGACGTTG 


AGTCTGCTTA 


ACATCGCTAG 


CAAGGAGGTG 


AAGCAGGCGC 


CGATCGACGC 


2760 




GTTCGAGTCA 


ACTGACTTGA 


TCATTGGCGA 


GGTTGCTTGG 


CTCACTGATA 


CTCACACCAC 


2820 




CGTTGCTGCT 


AAGGCGTTCA 


ACCGTGTCCA 


GGACCAGCAA 


AAGGTCGTCG 


CGGTCGATAC 


2880 




TGCCTCGAAC 


AAGGCTACTG 


TCATCAGCGA 


CCGAGATGGG 


ACCGATGGAT 


GGCTCGATAA 


294 0 




CCTTCTTTCA ATGAAGTATA 


TTGGCCCTAT 


CAAGCCGTCC 


GACAAGGATG 


CCTACTACAT 


3000 


on 


CGACATCTCT 


GACCATTCGG 


GATGGGCGCA TCTGTATCTC 


TTCCCCGTTT 


CGGGCGGCGA 


3060 




ACCTATCCCA 


CTAACCAAAG 


GCGACTGGGA GGTCACGTCT ATTCTGAGTA 


TTGATCAGGA 


3120 




ACGCCAGTTG 


GTGTACTACC 


TGTCGACTCA ACACCACAGC 


ACCGAGCGCC 


ATCTCTACTC 


3180 




CGTCTCCTAT TCCACGTTTG CGGTCACCCC GCTCGTCGAC GACACCGTTG 


CCGCGTACTG 


3240 




GTCTGCTTCC 


TTCTCCGCGA 


ACTCGGGCTA 


CTACATCCTC 


ACATACGGAG 


GCCCAGACGT 


3300 


OU 


ACCCTACCAG 


GAACTCTACA 


CGACCAACAG 


TACCAAACCA 


CTCCGCACAA 


TCACCGACAA 


3360 




CGCCAAAGTA 


CTCGAGCAAA 


TCAAGGACTA 


TGCATTGCCC 


AACATCACCT 


ACTTCGAGCT 


3420 




TCCCCTCCCC 


TCCGGAGAAA 


CCCTCAATGT 


GATGCAGCGC 


TTACCCCCCG 


GGTTCTCCCC 


3480 




GGATAAGAAG 


TACCCCATAC 


TTTTCACCCC 


ATACGGCGGC 


CCAGGCGCCC 


AAGAAGTGAC 


3540 




CAAGAGATGG 


CAAGCCCTGA 


ATTTCAAGGC 


CTATGTCGCC 


TCCGACAGCG 


AACTCGAGTA 


3600 


AO 


CGTAACCTGG 


ACTGTCGACA 


ACCGCGGCAC 


AGGTTTCAAA 


GGACGCAAGT 


TCCGCTCCGC 


3660 




CGTCACGCGC 


CAACTCGGCC 


TCCTCGAAGC 


AGAAGACCAG 


ATCTACGCCG 


CGCAACAGGC 


3720 




GGCCAACATC 


CCCTGGATCG 


ATGCAGACCA 


CATCGGCATC 


TGGGGCTGGA 


GTTTCGGAGG 


3780 




CTACTTGACC 


AGCAAGGTCC 


TGGAGAAGGA 


CAGCGGTGCT 


TTCACATTAG 


GAGTCATCAC 


3840 




CGCCCCTGTT TCTGACTGGC GTTTCTACGA CTCAATGTAC ACGGAGCGCT ACATGAAGAC 


3900 


OU 


CCTCTCGACC 


AATGAGGAGG 


GCTACGAGAC 


CAGCGCCGTC 


CGCAAGACTG 


ACGGGTTCAA 


3960 




GAACGTCGAG 


GGCGGATTCT 


TGATCCAGCA 


CGGAACGGGC 


GACGATAACG 


TCCATTTCCA 


4020 




GAACTCGGCT GCGCTGGTGG 


ATCTCCTGAT 


GGGCGATGGC 


GTCTCTCCTG 


AGAAGCTCCA 


4080 




TTCGCAATGG 


TTCACAGACT 


CAGACCACGG 


AATCAGCTAC 


CATGGTGGCG 


GCGTGTTCCT 


414 0 




GTACAAGCAA 


CTGGCCCGGA 


AGCTCTACCA 


GGAGAAGAAC 


CGACAGACGC 


AGGTGCTGAT 


4200 


60 


GCACCAGTGG 


ACTAAGAAGG 


ACTTGGAGGA 


GTAGAAGCGG 


CACATCATTC 


ATTCATTTTA 


4260 




AAGCGACTGG 


CTACACATAG 


CATACATAGC 


AATTGATACT 


TCGTATTTTA 


CCCTCCCCAC 


4320 




AGCCACGACC 


ATCACCCATT 


GGCGCAAAAT 


TCTCCCCGCA 


CCATAAACTA 


GCGCGACGAG 


4380 




GCTGAAAATC 


TGCCAGAAAT 


CTACTTAAAG 


CTCGTGTTGG 


CCCAGTCCCT 


CACAACCCAA 


4440 




ACCATCCCAA 


GTAAACAAAA 


CCAAAAAAAA 


ATCCCATAGA 


AAATGGCCGA 


CATCCCCACC 


4 500 


70 


TCAACAGTCC 


AAATCACAAC 


CCTCCCCACC 


AAATCCGTAA 


CAATCACCCC 


GCAACGAGCG 


4560 




ACCATCGTTC 


GCGAGATACA 


CACCTCCATC 


CAGGTATGCA 


CATACCACCT 


CACCTGACCA 


4620 
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TCCAACCCTA CTTACAGTCA ACGTAAACTA ACAAAATTAA AAAAATAAAA AGACAGGCCA 4680 

ACACGAACTA ATAATCACCG GCCTCGACCC AAGAGTAGAC ACCGACTCCA TTTTACTCGA 4 74 0 

AGGAACAGGA ACGGCCACAA TAACCGATAT CCAAACCTCG ATAGTCCCCC GACAGGAAAA 4800 

ATTCGAGGAT ATCTATCCCG CCGAATCAGA CTCCGACGAC TCCCCAGAGC CCGATTCCGA 4 860 

10 CTCCGACCTT GACCACGATG ACCCCGAGTT ACAAGCTATC TCCGCATCCA TAGCCGAAGT 4 920 

CGAAGCGCGA CTTGCGCGAG CGGAAAATGA ACAGACGATG GCGGTTTCCA TCCGGGAGTT 4 980 

TCTGGATGGG TATGCCAAGA AGATGGATCC GGAGCATGTG GACGCGGAGA TGCTAGATGG 504 0 

GTTCTTGGGG CTTTATACCC GGCAGCGGGT GGAGGGGTTT CAGCGGCATC ATCAGGCTGG 5100 

GGTGGAGTAT GGGAAGGGGG AGAGGGAGCT TGCGCGGTTG GTGAAGAGGA ACGCGGGAAG 5160 

20 ATTGAGGGTC GGTTGAAGAG GGCTAGGGAG GTGGTGAAGA AGAAGGAGCG GAGGGAGAGG 5220 

GAGAAGAGAG CCACCGAGCG TGCGAGGAAG ACTGAACAGC GGAAGATGAA GAGGGAGGAG 5260 

AGACTCAAGT TCTGGACGAC GCGGGTTGGG CAGGTGGTTG TGTCATCTGG ATAGTCAGGC 5340 

CGGGACTWGC CGGCGCAGTT CCATCGTTGA ATCGGGTTGA ACGGTTKTCT GGTTGTGTGT 54 00 

AGTATTTCAT GCGGAGCCTG TGTGGATGTC GACGTGTGCG TGCTGAGACT ATGTTGTGTA 54 60 

30 CGWMTATAGA TTTAATTAAG GATCCKGCGT GCCGCC 54 96 



(2) • INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : protein 
(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 16 

(D) OTHER INFORMATION: /labels signal -peptide 
(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 17.. 771 

(D) OTHER INFORMATION: /label = secreted-enzyme 

/note « "enzyme providing a 

prolyl -pept idyl -peptidase 
activity" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

Met Lys Tyr Ser Lys Leu Leu Leu Leu Leu Val Ser Val Val Gin Ala 

-15 -10 -5 

Leu Asp Val Pro Arg Lys Pro His Ala Pro Thr Gly Glu Gly Ser Lys 
1 5 10 15 

Arg Leu Thr Phe Asn Glu Thr Val Val Lys Gin Ala lie Thr Pro Thr 

20 25 30 

Ser Arg Ser Val Gin Trp Leu Ser Gly Ala Glu Asp Gly Ser Leu Arg 

35 40 45 

Val Arg Gly Gly Arg Arg Gin Ser His His Arg Glu His Arg His Gin 

50 55 60 

Arg Val Thr His Ala His Pro Ala Asp Lys lie Pro Thr Gly Lys Glu 
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65 70 75 80 

Ala Phe Asn Tyr Trp lie His Pro Asp Leu Ser Ser Val Leu Trp Ala 

85 90 95 

Ser Asn His Thr Lys Gin Tyr Arg His Ser Phe Phe Ala Asp Tyr Tyr 

100 105 110 

Val Gin Asp Val Glu Ser Leu Lys Ser Val Pro Leu Met Pro Asp Gin 

115 120 125 

Glu Gly Asp lie Gin Tyr Ala Gin Trp Ser Pro Val Gly Asn Thr lie 

130 135 140 

Ala Phe Val Arg Glu Asn Asp Leu Tyr Val Trp Asp Asn Gly Thr Val 
145 ~ 150 155 160 

Thr Arg lie Thr Asp Asp Gly Gly Pro Asp Met Phe His Gly Val Pro 

165 170 175 

Asp Trp lie Tyr Glu Glu Glu lie Leu Gly Asp Arg Tyr Ala Leu Trp 

180 185 190 

Phe Ser Pro Asp Gly Glu Tyr Leu Ala Tyr Leu Ser Phe Asn Glu Thr 

195 200 205 

Gly Val Pro Thr Tyr Thr Val Gin Tyr Tyr Met Asp Asn Gin Glu He 

210 215 220 

Ala Pro Ala Tyr Pro Trp Glu Leu Lys He Arg Tyr Pro Lys Val Ser 
225 230 235 240 

Gin Thr Asn Pro Thr Val Thr Leu Ser Leu Leu Asn He Ala Ser Lys 

245 250 255 

Glu Val Lys Gin Ala Pro He Asp Ala Phe Glu Ser Thr Asp Leu lie 

260 265 270 

He Gly Glu Val Ala Trp Leu Thr Asp Thr His Thr Thr Val Ala Ala 

275 280 285 

Lys Ala Phe Asn Arg Val Gin Asp Gin Gin Lys Val Val Ala Val Asp 

290 295 300 

Thr Ala Ser Asn Lys Ala Thr Val lie Ser Asp Arg Asp Gly Thr Asp 
305 310 315 * * 320 

Gly Trp Leu Asp Asn Leu Leu Ser Met Lys Tyr lie Gly Pro lie Lys 

325 330 335 

Pro Ser Asp Lys Asp Ala Tyr Tyr lie Asp lie Ser Asp His Ser Gly 

340 345 350 

Trp Ala His Leu Tyr Leu Phe Pro Val Ser Gly Gly Glu Pro lie Pro 

355 360 " 365 

Leu Thr Lys Gly Asp Trp Glu Val Thr Ser lie Leu- Ser He Asp Gin 

370 375 380 

Glu Arg Gin Leu Val Tyr Tyr Leu Ser Thr Gin His His Ser Thr Glu 
385 390 395 400 

Arg His Leu Tyr Ser Val Ser Tyr Ser Thr Phe Ala Val Thr Pro Leu 

405 410 415 

Val Asp Asp Thr Val Ala Ala Tyr Trp Ser Ala Ser Phe Ser Ala Asn 

420 425 430 

Ser Gly Tyr Tyr He Leu Thr Tyr Gly Gly Pro Asp Val Pro Tyr Gin 

435 440 445 

Glu Leu Tyr Thr Thr Asn Ser Thr Lys Pro Leu Arg Thr lie Thr Asp 

450 455 460 

Asn Ala Lys Val Leu Glu Gin lie Lys Asp Tyr Ala Leu Pro Asn lie 
465' 470 475 480 

Thr Tyr Phe Glu Leu Pro Leu Pro Ser Gly Glu Thr Leu Asn Val Met 

485 490 495 

Gin Arg Leu Pro Pro Gly Phe Ser Pro Asp Lys Lys Tyr Pro lie Leu 

500 505 510 

Phe Thr Pro Tyr Gly Gly Pro Gly Ala Gin Glu Val Thr Lys Arg Trp 

515 520 525 

Gin Ala Leu Asn Phe Lys Ala Tyr Val Ala Ser Asp Ser Glu Leu Glu 

530 535 540 

Tyr Val Thr Trp Thr Val Asp Asn Arg Gly Thr Gly Phe Lys Gly Arg 
545 550 555 560 

Lys Phe Arg Ser Ala Val Thr Arg Gin Leu Gly Leu Leu Glu Ala Glu 

565 570 575 

Asp Gin He Tyr Ala Ala Gin Gin Ala Ala Asn lie Pro Trp lie Asp 

580 585 590 

Ala Asp His lie Gly lie Trp Gly Trp Ser Phe Gly Gly Tyr Leu Thr 

595 600 605 

Ser Lys Val Leu Glu Lys Asp Ser Gly Ala Phe Thr Leu Gly Val lie 

610 615 620 

Thr Ala Pro Val Ser Asp Trp Arg Phe Tyr Asp Ser Met Tyr Thr Glu 
625 630 635 640 
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Arg 


Tyr 


Met 


Lys 


Thr 


Leu 


Ser 


Thr Asn Glu Glu Gly Tyr Glu Thr Ser 








645 






650 655 


Ala 


Val 


Arg 


Lys 
660 


Thr 


Asp 


Gly 


Phe Lys Asn Val Glu Gly Gly Phe Leu 
665 670 


He 


Gin 


His 


Gly 


Thr 


Gly 


Asp 


Asp Asn Val His Phe Gin Asn Ser Ala 






675 




680 685 


Ala 


Leu 


Val 


Asp 


Leu 


Leu 


Met 


Gly Asp Gly Val Ser Pro Glu Lys Leu 




690 








695 


700 


His 


Ser 


Gin 


Trp 


Phe 


Thr 


Asp 


Ser Asp His Gly lie Ser Tyr His Gly 


705 










710 




715 720 


Gly 


Gly 


Val 


Phe 


Leu 


Tyr 


Lys 


Gin Leu Ala Arg Lys Leu Tyr Gin Glu 








725 




730 735 


Lys 


Asn 


Arg 


Gin 
740 


Thr 


Gin 


Val 


Leu Met His Gin Trp Thr Lys Lys Asp 
745 750 


Leu 


Glu 


Glu 
755 













20 (2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

GCCTGGACCA CACTGACC 18 

30 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single . 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

TCCACCATGA AGTACTCC 18 



(2) INFORMATION FOR SEQ ID NO : 5: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATCGCCGAGG ATCTCCTC 18 



60 
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(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

GAATTCCATG GTGTCCTCGT CGG 23 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

GAATTCGAGC CGTCAGTGAG GCTC 24 



(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 

TGGTCGATAT CCTGGATGTG CCTCGGAAAC CA 32 



(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 



TTGCGGCCGC TACTCCTCCA AGTCCTTCTT 
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1. A recombinant prolyl-dipeptidyl-peptidase (DPP IV) from Aspergillus oryzae 
comprising the amino-acid sequence from amino acid 1 to amino acid 755 of SEQ 

5 ID NO:2 or functional derivatives thereof 

2. A recombinant prolyl-dipeptidyl-peptidase according to claim 1 which is fused 
to a leader peptide. 

10 3. A recombinant prolyl-dipeptidyl-peptidase according to claim 2 which is fused 
to the leader peptide of Aspergillus oryzae having the amino-acid sequence from 
amino acid -16 to amino acid -1 of SEQ ID NO:2 or functional derivatives thereof. 

4. A leader peptide of Aspergillus oryzae having the amino-acid sequence from 
1 5 amino acid -1 6 to amino acid -1 of SEQ ID NO:2 or functional derivatives thereof. 

5. A DNA molecule which comprises a dpplV gene encoding the enzyme 
according to claim 1 . 

20 6. A DNA molecule according to claim 5, which is a vector comprising the dpplV 
gene. 

7. A DNA molecule according to claim 5, wherein the dpplV gene is operably 
linked to at least one regulatory sequence able to direct the expression of the gene. 

25 

8. A DNA molecule according to claim 7, wherein the regulatory sequence is - 
derived from another organism than the one from which the dppYV gene is 
derived. 

30 9. A DNA molecule according to claim 5, wherein the dppYV gene comprises the 
coding parts of the nucleotide sequence SEQ ID NO: 1 or functional derivatives 
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thereof due to the degeneracy of the genetic code. 

10. A cell which expresses the enzyme according to claims 1-4 by recombinant 
technology. 

11. A cell according to claim 10, which is Pichia pastoris CNCM 1-1886. 

12. A cell according to claim 10 which is able to over-express the enzyme. 

13. A cell according to claim 12, which is an Aspergillus oryzae capable of 
providing at least 50 mU of prolyl-dipeptidyl-peptidase activity per ml of 
supernatant when grown in a minimal medium containing 1% (w/v) of wheat 
gluten. 

14. An Aspergillus oryzae according to claim 12, wherein it has integrated 
multiple recombinant functional dppTW genes according to claims 5 to 9. 

15. An Aspergillus oryzae according to claim 14 which is the Aspergillus oryzae 
CNCM 1-1888. 

16. An Aspergillus naturally providing a prolyl-dipeptidyl-peptidase activity 
which has integrated multiple copies of the Aspergillus native promoter which 
naturally directs the expression of the gene encoding the prolyl-dipeptidyl- 
peptidase activity. 

17. An Aspergillus according to claim 16, which has integrated multiple copies of 
the promoter contained in the nucleotide sequence SEQ ID NO:l. 

18. An Aspergillus according to claim 17, which has integrated multiple copies of 
the promoter having the coding nucleotide sequence from nucleotide 1836 to 
nucleotide 1966 of SEQ ID NO;l. 

19. A Aspergillus oryzae according to claim 18, which is the Aspergillus oryzae 
CNCM 1-1887. 

20. An Aspergillus naturally providing a prolyl-dipeptidyl-peptidase activity 
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which is manipulated genetically so that the dpplV gene is inactivated. 

21. A method for producing the enzyme according to claim 1, comprising 
cultivating recombinant cells according to claims 10-19 in a suitable growth 

5 medium under conditions that the cells express the enzyme, and optionally isola- 
ting the enzyme in the form of a concentrate. 

22. Use of the protein according to claim lor the cells according to claims 10-19 
to hydrolyse protein containing materials. 

10 

23. Use of an enzyme and/or a microorganism providing a prolyl-dipeptidyl- 
peptidase activity, in combination with at least an enzyme providing a prolidase 
activity, to hydrolyse protein containing materials. 

15 24. A food product comprising a protein hydrolysate obtainable by fermentation 
of protein containing materials with at least a microoganism providing a prolyl- 
dipeptidyl-peptidase activity higher than 50 mU per ml when grown in a minimal 
medium containing 1 % (w/v) of wheat gluten. 
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